What Should Markup Really Be? Applying theories of text to the design of markup systems

نویسندگان

  • David G. Durand
  • Elli Mylonas
  • Steven J. DeRose
چکیده

Introduction The issue of what text really is, and how it affects our notions of proper text representation has been with us almost from the beginning of text encoding [Goldfarb 1981, Reid 1980, Coombs, et al. 1987, DeRose, et al. 1990, Renear et al.]. The simplest reasonable view, that text is fundamentally an ordered hierarchical structure, determined by its editor and author, is an early one that has remained prominent, especially as reified by ISO 8879 (SGML). However, this simple model is not enough, which the TEI [Sperberg-McQueen and Burnard 1990,1993] quickly discovered as it moved text encoding from the realm of print production to that of scholarship, textual editing, and linguistic analysis. The TEI metalanguage committee identified problems with SGML’s simple hierarchical mechanisms, and developed and published techniques for working around them to encode non-hierarchical phenomena [Barnard et al. 1996]. In [Renear et al.] we began to analyze and label the theoretical and ontological foundations underlying many of the kinds of non-hierarchical structures discovered by practitioners using naive hierarchical markup. This paper uncovered some key notions and implicit partial theories underlying most previous theorizing about markup. The most important of these notions is the primacy of “analytic perspectives,” which we defined as a “natural family of methodology, theory, and analytical practice.” Perspectives explain various implicit presuppositions of the simple hierarchical approach. In this paper, we use these theoretical results to examine how the basic notions of hierarchical markup should be extended to allow a more expressive and accurate approach to document markup. Most of the following discussion is framed in terms of SGML, because SGML represents the state of the art in document description languages. The features that we propose can be regarded either as sugggestions for improving SGML, as specifications for some future successor, or even specifications for a new standard, that, like HyTime, would add additional power to SGML markup. We do not take a position on these thorny standards issues, concentrating rather on the problems to be addressed. In our examples we will use syntax based on SGML for clarity, but we will diverge from that syntax as necessary (and with explanation).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مدل سازی شوک های مارک آپ با استفاده از مدل DSGE (مورد ایران)

This paper investigates the effects of markup shocks of domestic and export goods prices on macroeconomic variables by using a Dynamic Stochastic General Equilibrium (DSGE) model for Iran, in order to examine the effect of the growth of market power and monopoly in domestic and exporting markets from a macroeconomic viewpoint. To this end, the optimal pricing process of domestic, importing and ...

متن کامل

Impact of Structural Components of Market on the Markup Level Based on Radial Basis Neural Network and Fuzzy Logic

This paper aims to evaluate the impact of several indices of market structure including entry to barrier, economies of scale and concentration degree on 140 active industries using the digit. Accordingly, we apply three methods including cost disadvantages ratio ( ), Herfindahl–Hirschman concentration index ( ) and Comanor and Willson criterion in order to assess the economies of scale and usin...

متن کامل

Welcome to Markup Languages: Theory & Practice

When embarking on such an uncertain project as the creation of a new journal in what is not exactly a recognized academic field — a print journal, no less, for the discussion of systems for electronic text markup — it is difficult to avoid the temptation to explain the rationale and goals of the journal. We have neither avoided nor attempted to resist that temptation, and this essay is the resu...

متن کامل

Applying Software Analysis Technology to Lightweight Semantic Markup of Document Text

Software analysis techniques, and in particular software “design recovery”, have been highly successful at both technical and businesslevel semantic markup of large scale software systems written in a wide variety of programming languages, and in particular have proven efficient and scalable in assisting the resolution of the “year 2000” problem for billions of lines of legacy source code. In t...

متن کامل

English Teachers Professional Development Needs for Web Development Skills: Meeting the Challenges of Teaching English Language in the Information Age

Utilizing the resources of the web in educational practices has made instructional processes more efficient and interesting and has made the learning process on the other hand much easier and attractive. With the web, English language teachers now have the option of engaging learners in online (web-based) instructions in addition to the use of conventional classroom instructions or alternativel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996